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(54) A novel diagnostic marker for splicing variants of genes associated with neurological 
function 



(57) Methods are described for detecting the pres- 
ence or absence of a four amino acid motif (VRXQ) in 
expressed proteins that arise from aberrant alternative 
splicing of premRNA in genes associated with normal 
neurological function which are useful for detecting neu- 
rodegenerative disease. The presence of these variants 



suggest that mutational events in these genes have oc- 
curred. Methods to measure the levels of gene expres- 
sion of such genes to detect neurodegenerative disease 
are provided. Nucleotide sequences and intron-exon 
junctional sequences of examples of this splicing variant 
and probes for detecting this variant which are useful as 
diagnostic reagents are also provided. 
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Description 

BACKGROUND OF THE INVENTION 

s In eukaryotes, the initial transcription of genomic DNA into RNA proceeds in the nucleus and yields a contiguous 

full-length reverse connplementary heteronuclear RNA (hnRNA) primary transcript. The hnRNA contains regions or 
contiguous blocks of nucleotide sequence that end up in the final mRNA (axons) interspersed between "intervening" 
nucleotide sequences (introns) that do not. In addition to adenylyl methylation and polyadenylation, these hnRNAs are 
extensively modified in a process referred to as RNA "splicing" wherein discontiguous exons are joined and the inter- 

10 vening intron precisely deleted as an RNA "lariat" from the final mature mRNA transcript (B. Rushkin et al. Cell 1 984, 
38:317; R.A. Padgett et al. Science 1984, 225:898). RNA splicing is a complex process involving large protein-RNA 
assemblies called spliceosomes that coordinate the concerted excision and ligation events to yield intron-free mRNAs 
(M.M. KonarskaandP.A. Sharp CelM987, 49:763; R. Reid et al. CelM988. 53:949; TA. Steitz Sci. Am. 1988, 258:56). 
In normal RNA processing, the resultant mRNA reflects the linear sequence orientation of the exons in the hnRNA: 

'5 however all exons do not end up in the final transcripts. Rather, several of the resultant mRNAs have only certain exons 
that result from "alternatively spliced" hnRNA, wherein discontiguous intron-exon junctions are spliced to bring for 
instance exoni and exon 4 into juxtaposition rather than exon1 and exon 2. Therefore, several mRNAs may arise from 
one gene sequence or hnRNA. Not all possible combinations of exons are normally represented in actual mRNA pools 
arising from one hnRNA as determined by mRNA, cDNA and protein analyses. As an example with three exons (Figure 

20 1), while seven combinations are possible (exoni -exon2-exon3, exon1-exon2, exoni -exon3, exon2-exon3, exonT 
exon2, or exon3) perhaps only two (exoni -exon2-exon3 and exoni -exon3) may actually result and be expressed at 
any appreciable level. These alternatively spliced transcripts are sometimes referred to as "variants". However for 
purposes of this invention splice "variant" refers to heretofore unrepresented or expressed mRNAs arising from potential 
alternative splice sites that result from genomic mutation altering the structure of the hnRNA so that these splices now 

25 occur. 

The location of splice sites in an hnRNA primary transcript can be determined by comparing the sequences of the 
corresponding genomic DNA with that of cDNA prepared by copying the corresponding mature mRNA. Any disconti- 
nuities between the genomic DNA and cDNA sequences mark the exon-intron boundaries. Such analyses of a number 
of different RNAs have defined moderately -short "consensus" sequences at the intron-exon boundaries in pre-mRNA 

30 and a tendency for a pyrimidine-rich region just upstream of the 3' splice junction (Figure 2). The only universally 
conserved nucleotides are the first two (GU) and last two (AG) in the intron (Figure 2), though there is a propensity for 
AG at the 5' exon termini and an initial G at the 3' exon. Only 30-40 nucleotides in the center portion of introns are 
necessary for efficient splicing. There is also a conserved A within the context of the pynmidine rich region of the intron 
(Figure 2) (..PyrPyrPurAPyrnAG: where Pyr is a pyrimidine and Pur is a purine nucleotide) which is the branch point 

^5 where the cleaved 5' exon-intron junction loops back to form the "lariat" splicing intermediate (Padgett et al. Science 
1984, 225:898). Genetic point mutations that delete or alter these conserved intronic nucleotides (5' GU. 3' AG. or 
branch point A) would eliminate these splice junctions and prevent normal splicing yielding aberrantly truncated tran- 
scripts or transcripts where this exon is deleted and another downstream exon spliced in, that normally may not be 
spliced in. 

40 A final mechanism for splice variation occurs when several GU or AG dinucleotide motifs occur near consensus 

intron splice regions of 5' exon-intron or 3' intron-exon boundaries, respectively, such that the splicing system may 
sometimes not correctly distinguish the correct splice site resulting in alternate protein product some of which may be 
non-functional or aberrant. 

Multiple examples of splice variations exist, many of which are associated with diseases or related disorders. 

45 Previous genetic linkage studies have shown a G to A mutation at the 3' splice junction of exon 8 of the gene encoding 
lysosomal acid lipase. Defects in this gene are associated with cholesterol ester storage disease that result in premature 
artherosclerosis, hepatomegaly, and elevated LDL cholesterol (U. Seedorf etal. Arterioscler Throb. Vase. Biol. 1995, 
15: 773-778). Two mutations at the exon l/intron 1 boundary altered the hepatic specific splicing of the human hy- 
droxymethylbilane synthase gene (third enzyme in heme biosynthetic pathway) and resulted in an enzyme with haif- 

50 normal activity (K,H, Astrin Human Mutat. 1994, 4:243-252). Deficiency of this enzyme activity eventually results in 
acute intermittent porphyria (AlP), an autosomal dominant inborn error of metabolism in which life-threatening attacks 
are precipitated by ecogenetic factors. Molecular cloning of cDNA and genomic DNA have provided probes allowing 
presymptomatic detection of these gene defects. In Menke's disease, a point mutation at the - 2 exonic position of a 
splice donor site in the middle of the gene causes exon-skipping and activation of a cryptic splice acceptor site (S.G. 

55 Kaler et al. Nat. Genet. 1994, 8:195-202). Exon skipping of the entire exon 19 results from a G to A point mutation at 
the 5' donor site of intron 1 9 in muscle phosphofructokinase deficiency (T Hamaguchi Biochem. Biophys. Res. Comm. 
1994, 202:444-449). Aberrant RNA splicing from a splice site mutant in the interleukin-2 receptor gamma (glL2-R) 
gene results in the generation of an abundant non-functional glL2-R containing a small intronic insertion and a second 



2 



BNSDOCI0;<EP 0791660A1? 



EP 0 791 660 A1 



mutant form with 5-fold lower affinity (J. R DiSanto et al. Proc. Natl. Acad. Sci. 1994, 91:9466-9470). These tsoforms 
produce an atypical form of an X chromosome-linked severe combined immunodeficiency disease. 

The presence of splice variants can be used as diagnostic markers of diseases associated with genetic mutations. 
For example, the expression of the exon 6 splice variant (v6) of the ceil adhesion molecule CD44 is correlated with 

5 the expression of the tumor suppressor gene p53. Both have been shown to be markers of tumor progression in 
colorectal cancer (J.W. Mulder et al. Gut, 1995, 36:76-80; Y. Matsumura Lancet 1992, 340:1053-1058). Asymptomatic 
carriers of the acute intermittent porphyria were identified by identification of a mutant allele containing a CG to CT 
transversion at the exonl/intron 1 boundary via in vitro amplification of DMA followed by hybridization of the target 
sequence to allele-specific oligonucleotides. 

10 Accordingly, splicing variants have been observed in several gene loci and several diseases. Identification of these 

variants has proven to be especially useful in diagnosis and detection of asymptomatic carriers. 



SUMMARY OF THE INVENTION 



^5 A novel insertional motif that anses from splice mutations or alternative utilization of cryptic or less preferred splice 

donor sites has now been identified. These splicing variations result in the in-frame insertion within a normal protein 
sequence of four amino acids, valine-arginine-X-glutamine (VRXQ), where X is a hydrophilic amino acid. This motif 
has been identified in splice variants of a receptor, an enzyme, and a putative channel protein, all of which are involved 
in normal neurological functioning. Identification of this motif allows for screening of genes and gene products for splice 

20 variations. 

A method for the detection of this motif in expressed proteins in vitro or in situ with the use of specific antisera, 
polyclonal or monoclonal antibodies is provided. A method for the detection of aliele-specific genetic mutations using 
selected oligonucleotides with standard hybridization-based detection techniques is also provided. A method for diag- 
nosing Alzheimer's Disease (AD) by detecting differences in levels of transcripts having the VRXQ insertion or proteins 
25 encoded therefrom is further provided. A preferred embodiment of such method for detecting AD provides for the 
detection of Familial Adult Onset Alzheimer's Disease (FAD). 



BRIEF DESCRIPTION OF THE FIGURES 



30 Figure 1 is a schematic of potential alternative splicing with 3 exons and 4 introns. 

Figure 2 is a schematic of the consensus exon-intron-exon structure and sequence. 

Figure 3 provides the sequence of the VRSQ variant of the presenilin 1 gene. SEQ ID NOS: 1 - 2 

Figure 4 provides PS-1 Oligonucleotide Probes. SEQ ID NOS: 3 - 5 

Figure 5 provides tabulated results of quantification of the ISH signal for PS-1 -long and PS-1 -short mRNAs in 
55 human brain. 

DETAILED DESCRIPTION OF THE INVENTION 



The presenilin 1 (herein "PS-1") gene encodes a neuropeptide predicted to be a classical seven transmembrane 
protein (Sherrington et al. Nature 1 995, 375:754-760). IVlissense mutations within this gene have been found in several 
families exhibiting early-onset Alzheimer's disease. Genomic analysis has revealed the intron-exon boundaries of the 
hnRNA. A common polymorphism located within the intron 3' to exon 9 was identified in early onset AD patients. This 
polymorphism also showed a strong association with the occurrence of typical late onset AD families. This particular 
mutation did not produce an alteration in the coding sequence but is typical of variations leading to alternatively spliced 
^5 proteins. 

Other mutations within different introns of the PS-1 gene have been identified. These lead to alternatively spliced 
variants as well. One novel variant of the PS-1 protein isolated from a human cerebellar cDNA library contains a four 
amino acid insertion between codons 26 and 27 (VRSQ) (Figure 3). This variant arises from alternative use of a 5' 
exon donor site in the exon 3/intron 3 boundary and results in the loss of some potential phosphorylation sites. A similar 
^0 motif (VRXQ- where X is a hydrophilic amino acid) arising from aberrant splicing has also arisen due to alternative 
splicing in several other neurological proteins as well. 

For example, the mRN A for tyrosine hydroxylase, the rate limiting enzyme in the synthesis of catecholomines, can 
undergo alternative splicing to produce several different isoforms (Kobayashi et al. J. Biochem. 1988, 103(6) 907-12; 
Lewis et al. Neuroscience 1993. 54(2) 477-92). The identified variants contain a 12 bp insertion encoding the sequence 
VRGQ. Isoforms containing the VRGQ insertion have also been found to exhibit alterations in phosphorylation by MAP 
kinase (Sutherland et al. Eur J Biochem. 1993, 217(2)715-22). Furthermore, a tyrosine hydroxylase variant containing 
this insertion has been implicated in Parkinson's disease. 

Another neuropeptide, gamma-Aminobutyric acid A (GABAA) receptor, undergoes alternative splicing to yield a 
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multiplicity of transcripts (Whiting et al. P.N,A.S. 1990, 87(24)9966-70; Lashametal. Biochem. Soc, Trans. 1991, 19 
(1 ) 9S). GABA receptors are multisubunit ligand gated ion channels which mediate neuronal inhibition by GABAA and 
are composed of at least four subunit types (alpha, beta, gamma, and delta). The beta 4 subunit can undergo alternative 
splicing at two 5'-donor splice sites separated by 12 bp in the region that encodes the presumed intracellular loop 

5 between transmembrane domains M3 and M4. The insertion of the 12 bp sequence results in the addition of a VREQ 
motif (Bateson et al. J. Neurochem 1991, 56(4) 1437-40). 

In all three neurological proteins, the alternative splice site generates variants containing a specific motif (VRXQ) 
which appears to be intracellularly located and alters phosphorylation by various kinases. 

In the present invention, a method for detecting the presence of the VRXQ motif in polyadenylated messenger 

^0 RNA transcripts (polyA mRNA) and resultant expressed proteins, (where V is valine, R is arginine, X is any hydrophilic 
amino acid residue, and Q is glutamine) or in cDNA resulting from these RNAs is provided. A method for quantitating 
such transcripts encoding and proteins having a VRXQ motif are also provided. Oligonucleotides having the anticodon 
sequences associated with the VRXQ motif having degenerate positions at the third base position of each codon can 
be used for the detection and quantitation of mRNA. Additionally these oligonucleotides can be associated with codon 

/5 sequences and used for the detection of cDNAs, and quantitation of the transcript from which the cDNA was derived. 
For example, codon and anticodon oligonucleotides for VRNQ comprise GU(N) AG(A/G) AA(C/U) CA(AyG) and the 
reverse complement. Hybridization of appropriate oligonucleotides can be detected and quantitated directly by proce- 
dures well known to those of skill in the art using radioactively or fluorescently labeled oligonucleotides, Indirect de- 
tection and quantitation procedures such as, but not limited to, biotinylated oligonucleotides/strepavidin-horseradish 

20 peroxidase, enhanced chemiluminescent detection, or fluorescently tagged strepavidins can also be performed. 

Specific antibodies against the VRXQ motif can also be used for detection of the motif and quantitation of proteins 
having the motif. Various procedures known in the art may be used for the production of such antibodies. 

For example, these antibodies can be obtained by direct injection of a polypeptide containing a VRXQ motif into 
an animal, preferably a nonhuman. The antibody so obtained will then bind to polypeptides containing this motif. Such 

25 antibodies can then be used to isolate and quantitate polypeptides containing this motif from tissues. 

For preparation of monoclonal antibodies, any technique which provides antibodies produced by continuous cell 
line cultures can be used. Examples include the hybridoma technique (Kohlerand Milstein, Nature 1975, 256:495-497), 
the trioma technique, the human B-cell hybridoma technique (Kozbor et al.. Immunology Today 1983, 4:72), and the 
EBV-hybridoma technique to produce human monoclonal antibodies (Cole et al. in Monoclonal Antibodies and Cancer 

30 Therapy Alan R. Liss, Inc., 1965, pp. 77-96). 

Techniques described for the production of single chain antibodies (U.S. Patent 4,946,778) can be adapted to 
produce single chain antibodies to the immunogenic motif of this invention. Also, transgenic mice may be used to 
express humanized antibodies to polypeptides containing this motif. 

Primary antibody-antigen reactions can be visualized and quantitated secondarily by standard enzyme-linked im- 

35 munosorbent assay (ELISA) procedures. An ELISA assay initially comprises preparing an antibody specific to a VRXQ 
motif, preferably a monoclonal antibody In addition a reporter antibody is prepared against the monoclonal antibody 
To the reporter antibody is attached a detectable reagent such as horse radish peroxidase. A sample is then removed 
from a host and incubated on a solid support, e.g., a polystyrene dish, that binds the proteins in the sample. Any free 
protein binding sites on the dish are then covered by incubating with a non-specific protein like BSA. Next, the mono- 

^0 clonal antibody is incubated in the dish during which time the monoclonal antibodies attach to any proteins containing 
the VRXQ motif attached to the polystyrene dish. All unbound monoclonal antibody is washed out with buffer. The 
reporter antibody linked to horseradish peroxidase is then placed in the dish resulting in binding of the reporter antibody 
to any monoclonal antibody bound to proteins containing the VRXQ motif. Unattached reporter antibody is then washed 
out. Peroxidase substrates are then added to the dish and the amount of color developed in a given time period is a 

-^5 measurement of the amount of protein containing the VRXQ motif present in a given volume of patient sample when 
compared against a standard curve to detect and quantitate the protein. Examples of other detectable reagents which 
can be used include, but are not limited to, luciferase and fluorescently or radioactively tagged secondary antibodies. 
Specific populations of immune cells or chimeric cells (e.g., hybridomas) that express antibodies to VRXQ epitopes 
on their celt surfaces and respond by degranulation or release of cellular contents such as histamines that can be 

so detected functionally or preloaded radiolabeled metals such as chromium are also useful. 

Embodiments of the invention can be used to detect alterations in and make comparisons between expression in 
of PS-1 variants in presumptive neurodegenerative disease, particularly neurodegenerative disease associated with 
head injury and AD, and more particularly chromosome 14 FAD. In a particularly preferred embodiment, probes and 
methods of the invention can be used to detect a reduction in the expression of PS-1 transcript encoding the VRSQ 

55 motif, shown by this invention to be a diagnostic marker for chromosome 1 4 FAD, since lowered levels are associated 
with chromosome 14 FAD. Preferred embodiments of the invention provide for comparisons between variants com- 
prising the VRSQ region with those lacking it enabling the diagnosis of AD, particularly chromosome 14 FAD. 

The methods of the invention to detect and quantitate PS-1 polynucleotide sequence, PS-1 expression levels and 
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gene expression products, particularly the imnnunological methods and methods using oligonucleotides, can be used 
with bodily tissues and fluids from individuals. Preferred bodily tissues and fluids useful with the methods of the invention 
include, but are not limited to, blood cells, plasma, skin cells, and brain cells, particularly neuronal, glial, and astrocyte 
cells. 

5 The following examples are provided for illustrative purposes only and are not intended to limit the invention. 

EXAMPLES 
Example 1 

10 

A novel splice s/ananX of the PS-1 gene described by Sherrington et al. Nature 1995, 375:754-760, was isolated 
from a human cerebellar and a human fibroblast library. In this novel splice variant there is a deletion of four amino 
acids at codons 26-27 (VRSQ). This arises from alternative use of a 5' exon donor site in the exonS/intron 3 (-52 to 75 
nt) boundary. The ...CAG/gta... boundary of the final Gin codon of exon 3 of the VRSQ motif provides a 5' exon AG 
^5 donor site and GT intron consensus 5' boundary and use of this splice site results in the insertion of the 12-nts encoding 
the VF^SQ motif. The upstream ...ACT/GTA... boundary of the Thr-Val codons provides the less preferred CT (AG 
preferred) 5' exonic boundary to the consensus GT 5' intronic boundary and splicing at this site would remove the 
VRSQ motif. Interestingly, in the PS-1 protein of Sherrington et al. Nature 1995, 375:754-760, this is the sole observed 
product and point mutations are interspersed elsewhere. 

20 

Example 2 

In the GABA receptor 4 subunit alternative splicing adds a VREQ motif (Bateson et al. J. Neurochem 1 991 , 56(4) 
1 437-40). A chicken genomic cDN A library was screened with chicken beta- 4' subunit cDN A at high stringency. South- 

25 ern blot analysis, using cDNA sequence specific oligonucleotides as probes and subsequent restriction mapping al- 
lowed the identification of overlapping DNA fragments containing the coding regions of the beta-4 subunit gene. These 
fragments were subcloned into pBluescript and sequenced. Complete sequencing of one of the clones revealed the 
presence of 12 bp in the part encoding the intracellular loop (amino acid residues 335-338). Analysis of the beta-4 
subunit gene reveals that the different transcripts encoding the two vanants (absence or presence of 1 2bp loop) arise 

00 by the use of one of two 5' -donor splice sites (located in the intron immediately 3' of the 12 bp sequence). 

Example 3 

The expression of two PS-1 mRNA transcripts, one containing (herein " PS-1 -long ") and one lacking the VSRQ 
55 motif (herein "PS- 1 -short"), in the brains of patients with early onset FAD was analyzed. In situ hybridization (ISH) was 
used to determine the qualitative and quantitative pattern of expression of PS-1 mRNA in the brains of early onset 
(presumptive chromosome 14-linked) FAD cases; comparisons with brains from patients with late onset AD and from 
normal individuals were made. 

40 In Situ Hybridization 

PS-1 mRNA expression was examined in 4 neurologically normal control cases, 6 late onset AD cases and 3 early 
onset FAD cases. The late onset cases were thought to be of a sporadic nature as there was no evidence of family 
history and the mean age at death was 81 .2 years (range: 79-84 years); they had a mean post mortem delay of 8.3 
hours. The early onset FAD cases were presumed to be linked to chromosome 14 as they all had onset ages, family 
history, clinical presentations and histopathology typical of chromosome 14-tinked FAD. For these the mean age at 
death was 45 years (range: 44-46 years) and the mean post mortem delay was 41 .7 hours. All AD cases were diagnosed 
according to standard pathological criteria (Khachaturian, 1985, Archives of Neurology. 42:1097-1105). The controls 
had a mean age at death of 68.8 years (range: 57-85 years) and mean post mortem delay of 11.8 hours. The brain 
50 regions examined were the hippocampus, temporal cortex and frontal cortex (regions severely affected by AD pathol- 
ogy), the visual cortex (an area relatively unaffected, but which at the time of death may be in the early stages of the 
disease process) and the cerebellum (an area not affected by the classic pathology associated with AD and with no 
clinical involvement). 

Three different oligoprobes were chosen and synthesized (Figure 4): one to detect PS-1 -long, one to PS-1 -short 
55 and one that recognizes both transcripts, PS-1 -both, These probes are not predicted to detect the transcripts of prese- 
nilin-2, a closely related gene on chromosome 1 (Rogaev, et al., 1995, Nature 376:775-78). 

The ISH methodology is well known in the art and has been described in detail elsewhere (Najlerahim et al., 1990, 
FEBS Letters 7:317-333). For the ISH analyses 10(m cryostat tissue sections were used. Probes were labelled at their 
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3' end with ^^S-dATP using the NEN DuPont 3' end labelling system. Hybridization and wash tennperatures for the 
various probes are given in Figure 4. Hybridized sections were apposed to tritiunn-sensitive film for the generation of 
autoradiographs. Hybridization with the PS-1 probes in the sense orientation on adjacent sections were used to control 
for non-specific background. The signal on autoradiographic film was quantified using an image analyzer (Seescan®). 
5 A representative area over most of a tissue section was measured: for example, in the hippocampus the different 
subfields were not separately quantified. The background signal (sense strand hybridization) was subtracted from the 
antisense signal. Statistical analysis of the data was performed using the well known two-tailed Student's t-test. 

Northern Analysis 

10 

Northern analysis was carried out with the PS-1 -both probe on a Northern blot (Clontech®, catalogue number: 
7750-1 ) containing polyA+ mRNA from a number of different human brain regions. The probe was 3' end labelled with 
32P-dATP using terminal transferase and hybridized under standard conditions (Ciontech®, data sheet). 

15 Diagnostic Methods and Reagents for FAD 

In situ hybridization using all three probes revealed that PS-1 mRNA was present in all of the brain regions exam- 
ined. Hybridization with a sense strand control probe gave a very low background signal. In the cerebral cortex (three 
regions) a signal was detected in both the grey and white matter, often with a similar intensity. A diffuse rather than 

20 laminar pattern was observed in grey matter and in the hippocampus the different subfields were not readily delineated 
(although the dentate gyrus was sometimes visible). In the cerebellum^ the granule cell layer contained the most la- 
belling. These data are consistent with PS-1 mRNA expression in both neurons and glia. 

Northern analysis confirmed that the PS-1 -both oligoprobe detected a major transcript in human brain of the correct 
size for PS-1 mRNA (in accordance with the sequence data of Sherrington et al. 1995, Nature 375:754-760). A major 

25 band of approximately 3.4 kb was detected in all brain regions examined, indicating a wide distribution in brain for PS- 
1 mRNA. The observation of PS-1 mRNA in corpus callosum is consistent with the interpretation from ouir ISH data 
that PS-1 is expressed in glia. 

A similar anatomical pattern was seen by ISH, in each region, for both PS-1 -long and PS-1 -short transcripts. 
Nevertheless there appeared to be differences between the transcripts in their levels of expression according to brain 

^0 region: for example PS-1 -short was relatively less abundant in the cerebellum (Figure 5). 

The hybridization pattern was similar for the controls, sporadic AD and FAD cases. Quantification of the autoradi- 
ographic film revealed a statistically significant reduction in the amount of PS-1 -long mRNA in FAD hippocampus and 
frontal cortex compared with the sporadic AD cases (Figure 5: p - 0.003 and p = 0.014 respectively). In the cerebellum 
there was no significant difference between the controls, sporadic AD and FAD cases. The reduction in PS-1 -long 

36 appears to be specific because there was no change in the level of expression of PS-1 -short mRNA in any brain region 
investigated between the three different groups (Figure 5), which indicates reasonable data consistency. 



40 
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SEQUENCE LISTING 
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(i) APPLICANT: University of South Florida, Washington University and 
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(ii) TITLE OF INVENTION: A Novel Diagnostic Marker for Splicing 
Variants of Genes Associated with Neurological Function 

(iii) NUMBER OF SEQUENCES: 5 
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(A) ADDRESSEE: SmithKline Beecham Corporation 

(B) STREET: 709 Swedeland Road, P.O. Box 1539 

(C) CITY: King of Prussia 

(D) STATE: PA 

(E) COUNTRY: USA 

(F) ZIP: 19406-0939 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: DISKETTE. 3.5 INCH, 1.44 Mb 

STORAGE 

(B) COMPUTER: IBM 486 

(C) OPERATING SYSTEM: WINDOWS FOR WORKGROUPS 

(D) SOFTWARE: WORDPERFECT 5.1 

(vi) CURRENT APPLICATION DATA: 
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(B) FILING DATE: Herewith 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER:60/0 12,077 
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(viii) ATTORNEY/ AGENT INFORMATION: 
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(A) TELEPHONE: 610-270-5024 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1914 
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ixi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


: 1: 






CCGTACGTAG 


CCGCGGCGGC 


AGCGGGGCGG 


CGGGGAAGCG 


TATGCATACA 


50 


5 


AATTTATTAG 


CATGCAGACT 


GGGAGAACCA 


CAAGACCTAA 


TCTGGGAGCC 


100 




TGCAAGTGAC 


AACAGCCTTT 


GCGGTCCTTA 


GACAGCTTGG 


CCTGGAGGAG 


150 




AACACATGAA 


AGAAAGAACC 


TCAAGAGGCT 


TTGTTTTCTG 


TGAAACAGTA 


200 




TTTCTATACA 


GTTGCTCCAA 


TGACAGAGTT 


ACCTGCACCG 


TTGTCCTACT 


250 


10 


TCCAGAATGC 


ACAGATGTCT 


GAGGACAACC 


ACCTGAGCAA 


TACTAATGAC 


300 




AATAGAGAAC 


GGCAGGAGCA 


CAACGACAGA 


CGGAGCCTTG 


GCCACCCTGA 


350 




GCCATTATCT 


AATGGACGAC 


CCCAGGGTAA 


CTCCCGGCAG 


GTGGTGGAGC 


400 




AAGATGAGGA 


AGAAGATGAG 


GAGCTGACAT 


TGAAATATGG 


CGCCAAGCAT 


450 


15 


GTGATCATGC 


TCTTTGTCCC 


TGTGACTCTC 


TGCATGGTGG 


TGGTCGTGGC 


500 




TACCATTAAG 


TCAGTCAGCT 


TTTATACCCG 


GAAGGATGGG 


CAGCTAATCT 


550 . 




ATACCCCATT 


CACAGAAGAT 


ACCGAGACTG 


TGGGCCAGAG 


AGCCCTGCAC 


600 




TCAATTCTGA 


ATGCTGCCAT 


CATGATCAGT 


GTCATTGTTG 


TCATGACTAT 


650 


20 


CCTCCTGGTG 


GTTCTGTATA 


AATACAGGTG 


CTATAAGGTC 


•ATCCATGCCT 


700 




GGCTTATTAT 


ATCATCTCTA 


TTGTTGCTGT 


TCTTTTTTTC 


ATTCATTTAC 


750 




TTGGGGGAAG 


TGTTTAAAAC 


CTATAACGTT 


GCTGTGGACT 


ACATTACTGT 


800 




TGCACTCCTG 


ATCTGGAATT 


TTGGTGTGGT 


GGGAATGATT 


TCCATTCACT 


850 


25 


GGAAAGGTCC 


ACTTCGACTC 


CAGCAGGCAT 


ATCTCATTAT 


GATTAGTGCC 


900 




CTCATGGCCC 


TGGTGTTTAT 


CAAGTACCTC 


CCTGAATGGA 


CTGCGTGGCT 


950 




CATCTTGGCT 


GTGATTTCAG 


TATATGATTT 


AGTGGCTGTT 


TTGTGTCCGA 


1000 




AAGGTCCACT 


TCGTATGCTG 


GTTGAAACAG 


CTCAGGAGAG 


AGATGAAACG 


1050 


30 


CTTTTTCCAG 


CTCTCATTTA 


CTCCTCAACA 


ATGGTGTGGT 


TGGTGAATAT 


1100 




GGCAGAAGGA 


GACCCGGAAG 


CTCAAAGGAG 


AGTATCCAAA 


AATTCCAAGT 


1150 




ATAATGCAGA 


AAGCACAGAA 


AGGGAGTCAG 


AAGACACTGT 


TGCAGAGAAT 


1200 


35 


GATGATGGCG 


GGTTCAGTGA 


GGAATGGGAA 


GCCCAGAGGG 


ACAGTCATCT 


1250 


AGGGCCTCAT 


CGCTCTACAC 


CTGAGTCACG 


AGCTGCTGTC 


CAGGAACTTT 


1300 




CCAGCAGTAT 


CCTCGCTGGT 


GAAGACCCAG 


AGGAAAGGGG 


AGTAAAACTT 


1350 




GGATTGGGAG 


ATTTCATTTT 


CTACAGTGTT 


CTGGTTGGTA 


AAGCCTCAGC 


1400 


40 


AACAGCCAGT 


GGAGACTGGA 


ACACAACCAT 


AGCCTGTTTC 


GTAGCCATAT 


1450 


TAATTGGTTT 


GTGCCTTACA 


TTATTACTCC 


TTGCCATTTT 


CAAGAAAGCA 


1500 




TTGCCAGCTC 


TTCCAATCTC 


CATCACCTTT 


GGGCTTGTTT 


TCTACTTTGC 


1550 




CACAGATTAT 


CTTGTACAGC 


CTTTTATGGA 


CCAATTAGCA 


TTCCATCAAT 


1600 


45 


TTTATATCTA 


GCATATTTGC 


GGTTAGAATC 


CCATGGATGT 


TTCTTCTTTG 


1650 


ACTATAACAA 


AATCTGGGGA 


GGACAAAGGT 


GRTTTTCCTG 


TGTCCCACAT 


1700 




CTAACAAAGT 


CAAGATTCCC 


GKCTGGACTT 


TTGCAGCTTC 


CTKCCAAGTC 


1750 




TTCCTGACCA 


CCTTGCACTW 


TTGGACTTTG 


GARGGAGGTG 


CCTAKAGAAA 


1800 


50 


ACGRTTTTGA 


MCATACTTCA 


TCGCAGTGGA 


CTGTGTCCCT 


CGGTGCAGAA 


1850 




ACTACCAGAT 


TTGAGGGACG 


AGGTCAAGGA 


GATATGATAG 


GCCCGGAAGT 


1900 




TGCTGTGCCC 


ATCA 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTER! ST ICS: 

(A) LENGTH: 463 

(B) TYPE; Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

MET THR GLU LEU PRO ALA PRO LEU SER TYR ?KE GLN ASN ALA GLN 

1 5 10 15 

MET SER GLU ASP ASN HIS LEU SER ASN THR ASN ASP ASN ARG GLU 

20 25 30 

ARG GLN GLU HIS ASN ASP ARG ARG SER LEU GLY HIS PRO GLU PRO 

35 40 45 

LEU SER ASN GLY ARC PRO GLN GLY ASN SER AP.G GLN VAL VAL GLU 

50 55 60 

GLN ASP GLU GLU GLU ASP GLU GLU LEU THR LEU LYS TYR GLY ALA 

65 70 75 

LYS HIS VAL ILE MET LEU PHE VAL PRO VAL THR LEU CYS MET VAL 

80 85 90 

VAL VAL VAL ALA THR ILE LYS SER VAL SER PHE TYR THR ARG LYS 

95 100 105 

ASP GLY GLN LEU ILE TYR THR PRO PHE THR GLU ASP THR GLU THR 

110 115 120 

VAL GLY GLN ARG ALA LEU HIS SER ILE LEU ASN ALA ALA ILE MET 

125 130 135 

ILE SER VAL ILE VAL VAL MET THR ILE LEU LEU VAL VAL LEU TYR 

140 145 150 

SO LYS TYR ARG CYS TYR LYS VAL ILE HIS ALA TRP LEU ILE ILE SER 

155 160 165 

SER LEU LEU LEU LEU PHE GLU GLU SER PHE ILE TYR LEU GLY GLU 

55 1 70 1 75 1 80 
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VAL PHE LYS TKR TYR ASN VAL ALA VAL ASP TYR ILE THR VAL ALA 
185 190 195 

LEU L£U ILE TRP ASN PHE GLY VAL VAL GLY MET ILE SER ILE HIS 
200 205 210 

TRP LYS GLY PRO LEU ARG LEU GLN GLN ALA TYR LEU ILE MET ILE 
215 220 225 

SER ALA LEU MET ALA LEU VAL PHE ILE LYS TYR LEU PRO GLU TRP 
230 235 240 

THR ALA TRP LEU ILE LEU ALA VAL ILE SER VAL TYR ASP LEU VAL 
245 250 255 

ALA VAL LEU CYS PRO LYS GLY PRO LEU ARG MET LEU VAL GLU THR 
260 265 270 

ALA GLN GLU ARG ASP GLU THR LEU PHE PRO ALA LEU ILE TYR SER 
275 280 285 

SER THR MET VAL TRP LEU VAL ASN MET ALA GLU GLY ASP PRO GLU 
290 295 300 

ALA GLN ARG ARG VAL SER LYS ASN SER LYS TYR ASN ALA GLU SER 
305 310 315 

THR GLU ARG GLU SER GLN ASP THR VAL ALA GLU ASN ASP ASP GLY 
320 325 330 

GLY PHE SER GLU GLU TRP GLU ALA GLN ARG ASP SER HIS LEU GLY 
335 340 345 

PRO HIS ARG SER THR PRO GLU SER ARG ALA ALA VAL GLN GLU LEU 
350 355 360 

SER SER SER ILE LEU ALA GLY GLU ASP PRO GLU GLU ARG GLY VAL 
365 370 375 

LYS LEU GLY LEU GLY ASP PHE ILE PHE TYR SER VAL LEU VAL GLY 
380 385 390 

LYS ALA SER ALA THR ALA SER GLY ASP TRP ASN THR THR ILE ALA 
395 400 405 
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TO 
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20 



40 



45 



SO 



55 



CYS PHE VAL ALA ILE LEU ILE GLY LEU CYS LEU THR LEU LEU LEU 

410 415 420 

LEU ALA ILE PHE LYS LYS ALA LEU PRO ALA LEU PRO ILE SER ILE 

425 430 435 

THR PHE GLY LEU VAL PHE TYR PHE ALA THR ASP TYR LEU VAL GLN 

440 445 450 

PRO PHE MET ASP GLN LEU ALA PHE HIS GLN PHE TYR ILE 

455 460 



(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GCACTCAATT CTGAATGCTG CCATCATGAT 

(2) INFORMATION FOR SEQ ID NO : 4: 



(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
■AGCAATACTG TACGTAGCCA GAATGACAAT 

(2) INFORMATION FOR SEQ ID NO : 5: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 29 base pairs 
fBJ TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CACCTGAGCA ATACWATGAC AATAGAGAA 
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FIGURE iA 
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II 20 29 38 47 56 

5' CCG TAG GTA GCC GCG GCG GCA GCG GGG CGG CGG GGA AGO GTA TGC ATA CAA ATT 



65 74 83 92 101 110 

TAT TAG CAT GCA GAG TGG GAG AAC CAC AAG ACC TWv TCT GGG AGC CTG CAA GTG 



119 128 137 146 155 160 

ACA ACA GCC TTT GCG G7C CTT AGA CAG CTT GGC CTG GAG GAG AAC ACA TGA AAG 



313 182 191 200 209 210 

AAA GAA CCT CAA GAG CCT TTC TTT TCT GTG AAA CAG TAT TTC TAT ACA OTT GCT 



227 23G 205 2S4 263 272 

CCA ATG ACA GAG TTA CCT GCA CCG TTG TCC TAG TTC CAG AAT GCA CAG ATG TCT 

MTELPAPLSYPONAQMS 

281 290 299 308 317 326 

GAG GAC AAC CAC CTG AGC AAT ACT AAT GAC AAT AGA GJ^. CGG CAG -GAG CAC AJ».C 

335 3U 353 362 371 380 

GAC AGA CGG AGC CTT GGC CAC CCT GAG CCA TTA TCT AAT GGA CGA CCC CAG GGT 

DRRS LGHP EPLSNG RPQG 

389 398 407 416 425 434 

AAC TCC CGG CAG GTG GTG GAG CAA GAT GAG GAA GAA GAT GAG GAG CTG ACA TTC 

NSRQVV EQDEEED. EELTL 

443 452 451 470 479 488 

AAA TAT GGC GCC AAG CAl' GTG ATC ATG CTC TTT GTC CCT GTG ACT CTC TGC ATG 

KVGAKHVIMtrvpVT LCM 

497 506 515 524 533 542 

GTG GTG GTC GTG GCT ACC ATT AAG TCA GTC ACC TTT TAT ACC CGG AAG GAT GCG 

VVV 'VATIKSVSFYTRKDG 
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FIGURE IB 

551 560 569 570 507 596 

CAC CTA ATC TAT ACC CCA TTC ACA CJPJ^ GAT ACC GAG ACT GIG GGC CAG AGA GCC 

QLIYTPFTEDTETVGQPA 

605 614 623 632 641 650 

CTG CAC TCA ATT CTG AAT GCT GCC ATC ATG ATC AGT CTC ATT GTT GTC ATG ACT 

LHSILNAA^MISVIVV^^T 

659 668 677 606 695 704 

ATC CTC CTG GTC GTT CTG TAT /vAA TAG ACG TGC TAT AAG GTC ATC CAT GCC TGG 

11, LVVLYK^RCYKVIHAW 

713 722 731 740 749 758 

CTT ATT ATA TCA TCT CTA TTG TTG CTG TTC TTT TTT TCA TTC ATT TAC TTG GGG 

** *" ~ ^ 
LIISSLLLLF£FSriYL-G 

767 775 705 794 803 612 

GAA GTG TTT AAA ACC TAT AAC GVT GCT GTG GAC TAC ATT ACT GTT GCA CTC CTC 

LV rKTYWVAVDYl TVALL 

S21 B30 839 849 8^37 866 

ATC TGG AAT TTT GGT GTG GTG GGA ATG ATT TCC ATT CAC TGG AAA GCT CCA CTT 

jWNFGVVGMISIHWKGi>l 

875 8 8 0 893 902 91 3 ' 920 

CGA CTC CAG CAG GCA TAT CTC ATT ATG ATT AGT GCC CTC ATG GCC CTG GTG TTT 

R L Q Q A Y L"-J H I S.A L M A L'V T 

929 938 947 958 965 974 

ATC A.^G TAC CTC CCT GAA TGG ACT GCG TGG CTC ATC TTG GCT GTG ATT TCA GTA 

IKYLPEWTAWLILAVJSV 

983 992 1001 1010. 1019 1028 

TAT GAT TTA GTG GCT CTT TTG TGT CCG AAA GGT CCA CTT CGT ATG CTG GTT GAA 

YDLVAVLCPKGPLRMLVE 

1037 1046 1055 1064 J 073 1082 

ACA GCr CAG GAG AGA GAT GAA ACG CTT TTT CCA GCT CTC ATT TAC TCC TCA ACA 

TAQRBDETLFPALI YSST 

1091 1100 1109 1118 1127 1136 

ATG GTG TGG *TTG GTG AAT ATG GCA GAA GGA GAC CCG GAA GCT CAA AGG AGA GTA 
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hV WLVNMA EGDPiAQRRV 

5 

ll<,5 115^ 11 63 1172 liei 1190 

TCC AAA AAT TCC AAG TAT AAT GCA CAA ACC ACA C7vA AGG GAG TCA C.\A GAC ACT 



SK^^SKY1JA£STER£S0DT 

1199 1208 121:' 1226 3235- 12^4 

GTT GCA GAG AAT GAT GAT GGC GGG TTC ACT GAG G;iA TGG GAA CCC GAG AGG GAC 



15 

B.203 1262 1271 12ff0 1289 1298 

AGT CAT CTA GGG OCT CAT CGC TCT ACA COT CvAG TCA CGA GCT GCT GTC CAG GKk 



SHLCPHRSTPESRAAVQE 

20 

\307 1316 1325 1334 1343 L352 

CTT TCC AGO AGT ATC CTC GCT GGT GAA GAC CCA GAG GAA AGG GGA GTA AAA CTT 



LS SSILAGEDPEERGVKL 

25 

(L361 1370 1379 1388 1397 1 406 

GGA TTG GGA CAT TTC ATT TTC TAC AGT GTT CTC GTT GGT AAA GCC TCA CCA ^CA 



GLGOri rVSVLVGKASAT 

30 

1415 1^24 1433 1442 1451 1460 

GCC AGT =GGA GAC TGG AAC ACA ACC ATA GCC TGT TTC GTA GCC ATA TTA ATT GGT 



AS G D WNTT IACrV A I*L I G 

35 

a469 1478 1487 1496 1505 1514 

TTG TGC CTT ACA TTA TTA CTC CTT GCC ATT TTC AAC AAA GCA TTG CCA GCT CTT 



LC L T L L L J, AI FK KA I. P A L 

40 

1523 1532 IS^i 1550 1559 1568 

CCA ATC TCC ATC ACC TTT GGG CTT GTT TTC TAC TTT GCC ACA GAT TAT CTT GTA 



PISITTG J. v'FYFATDyifV 

45 

1511 1586 1595 1604 1613 1622 

CAG CCr TTT ATG GAC CAA TTA GCA TTC CAT CAA TTT TAT ATC TAG CAT ATT TGC 



QPrMDQLATHQFYI 

50 

1631 1640 1649 1659 1667 1676 

GGT TAG AAT CCC ATG GAT GTT TCT TCT TTC ACT ATA ACA /lAA TCT GGC GAG GAC 
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FIGURE ID 



!fiB5 16^4 1703 171 2 jTCl i/JU 

AAA OCT GRV TTT CCT GTG TCC CAC ATC 7AA CAA ACT c;u\ GAT TCC CGK CTG GAG 



1739 17<B 1757 1 7 66 1775 1784 

TTT TGC AGC TTC CTK CCA AGT CTT CCT GAC CAC CTT GCA CTW TTG GAC TTT GGA 



a795 1 802 18 1 1 1 02O 1 829 183'8 

HGG AGG TGC CTA KAG /wAA ACG RTT TTG AMC ATA CTT CAT CGC AGT GGA CTG TGT 



1847 1656 1865 1974 1883 1892 

CCC rCG CTG CAG AAA CTA CCA GAT TTG AGG GAC GAG GTC AAG GAG ATA TCA TAG 



1901 1910 
GCC CGG /vAG TTG CTG TGC CCA TCA 3 ' 



Figure 4 

PS-1 Oligonucleotide Probes 



Probe Sense Sequence Bases * 



PS-i-both 5'-GCACTCAATTCTGAATGCTGCCATCATGAT-3* 638-667 24 50 

SEQIDNO: 3 

PS-l-long 5-'Ar,CAATACT GTACGTAGCCAG AATGACAAT-3' 315-344 23 49 

SEQIDNO: 4 

PS-i-shocT 5'.CACCTGAGCAATACT/AATGACAATAGAGAA-3' 309-323 and 336-350 22 47 
SEQ ID NO: 5 



50 *Refer3 to EMBL and GenBank entry HUMSI82R (accession number: L421 10); Sherrington et al 

1995» Nature 375:754-760. Ti represents the hybridization temperature (incubation) and Tw 
represents the wash temperature. The underlined bases code for the amino acids V, R, S and Q. 
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Figure 5. 

Quantification of the ISH signal for PS- 1 -long and PS- 1 -short mRNAs in human brain. 



Brain region 


Case 


PS-Mong(n) 


PS- 1 -short (n) 


Hippocampus 


Control 


0.025 ±0.014(2) 


0.023 (1) 




AD 


0.035 ± 0.007 (3) 


0.026 + 0.01 (3) 




FAD 


0.008 ±0,001 (3)* 


0.030 ± 0.004 (3) 


Frontal cortex 


AD 


0.024 ± 0.005 (3) 


0.042 ±0.014 (3) 




FAD 


0.012 ±0.0 (3)** 


0.022 ±0.011 (3) 


Cerebellum 


Control 


0.036 (1) 


0.013 (1) 




AD 


0.024 ± 0.007 (3) 


0.019 ±0.006 (3) 




FAD 


0.012 ± 0.002 (2) 


0.014 ± 0.005 (2) 


Temporal cortex 


FAD 


0.014 ± 0.009 (3) 


0.015 ±0.01 (3) 


Visual Cortex 


FAD 


0.016 ±0.007 (3) 


0.032 ±0.00 1 (3) 



35 



values represent means ± s.d.; units are arbitrary (machine grey levels). *FAD vs AD p = 
0.003; "FAD vs AD p = 0.014; Student's t-test. 



^5 Claims 

1. A method of identifying an individual susceptible to a neurological disease comprising: 

providing a sample of genetic material from an individual susceptible to a neurological disease; and 
50 detecting the presence of an alternative splice site comprising the sequence VRXQ, wherein V is valine, R is 

arginine, X is any hydrophobic amino acid residue and Q is glutamine, in a polyadenylated messenger RNA 
transcript or protein encoded therefrom in the sample of genetic material. 



2. The method of claim 1 wherein the sequence VRXQ is detected using selected oligonucleotide probes comprising 
55 anticodon sequences associated with the sequence VRXQ having degenerate positions at the third base position. 

3. The method of claim 2 further comprising associating said oligonucleotides with codon sequences and detecting 
cDNA. 
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4. The method of claim 1 wherein the sequence VRXQ is detected using an antibody against a polypeptide comprising 
the sequence VRXQ. 

5. The method of claim 1 wherein the neurological disease comprises Alzheimer's Disease and the mRNA or protein 
is encoded by the presenilin 1 gene. 

6. The method of claim 5 therein the sequence comprises a 4 amino acid insertion between codons 26 and 27 of the 
gene and the sequence VRSQ. 

7. The method of claim 1 wherein the mRNA or protein is encoded by the gamma-Aminobutyric acid A receptor gene 
and the sequence comprises VREQ. 

8. The method of claim 1 wherein the mRNA or protein is encoded by the tyrosine hydroxylase gene and the sequence 
comprises VRGQ. 



9. A method for diagnosing a neurological disease comprising determining the levels of polyadenylated messenger 
RNA transcripts or proteins encoded therefrom comprising the sequence VRXQ wherein V is valine, R is arginine/ 
X is any hydrophobic amino acid residue and Q is g!utamine= in a sample of genetic material and comparing these 
levels with established controls. 



10. The method of claim 9 wherein the neurological disease comprises Alzheimer's Disease and the mRNA or protein 
is encodes by the presenilin 1 gene. 

11. The method of claim 10 wherein the sequence comprises a 4 amino acid insertion between codons 26 and 27 of 
^5 the gene and the sequence VRSQ. 

1 2. The method of claim 9 wherein the mRNA or protein is encoded by the gamma-Aminobutyric acid A receptor gene 
and the sequence comprises VREQ. 

20 1 3. The method of claim 9 wherein the mRNA or protein is encoded by the tyrosine hydroxylase gene and the sequence 



40 
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50 
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comprises VRGQ. 
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